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Abstract. We investigate Newton's method for complex polynomials 
of arbitrary degree d, normalized so that all their roots are in the unit 
disk. We specify an explicit universal set of starting points, consisting 
of 0(dlog d) points and depending only on d, so that among them 
there are d points that converge very quickly to the d roots: we prove 
that the expected total number of Newton iterations required to find 
all d roots with precision e is 0(d i log 3 d + dlog | loge|), which can be 
further improved to 0(d 2 log 4 d + dlog | log e| ) ; in the worst case possibly 
with near-multiple roots, the complexity is 0(d log d(d+ | log e| )) . The 
arithmetic complexity for all these Newton iterations is the same as the 
number of Newton iterations, up to a factor of logd. 



1. Introduction 

Newton's root-finding method is as old as analysis, and it is known to 
be a very efficient method for locally finding approximate roots of smooth 
equations such as polynomials: once a reasonable approximation to a simple 
root is known, every iteration of the Newton method doubles the number of 
valid digits. However, Newton's method is known to be difficult to under- 
stand as a global dynamical system. For instance, the following difficulties 
may arise, even in the case of a polynomial p in a single complex variable: 

• orbits of the Newton map that get close to zeroes of the derivative 
p' will, under the Newton dynamics, jump near oo and will take a 
long time until they can get close to roots; 

• there may be open sets in C in which the Newton dynamics does not 
converge to any root of p; 

• the boundary of the attracting basins of the various roots may have 
positive measure, resulting in "bad" starting points of positive mea- 
sure; 

• even if almost all starting points in C converge to some root of p, it 
is not clear to find starting points for all roots: it is conceivable that 
some roots can be found only from small sets of starting points (we 
do not want to use deflation: it is numerically unstable unless the 
roots are found in a certain order, and it may destroy specific forms 
of the polynomial that are easy to evaluate); 

• finally, even if all roots are found, no efficient estimates on the re- 
quired number of iterations were known previously. 
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This manuscript addresses all these issues: we specify, for arbitrary polyno- 
mials in a single complex variable, a universal set of starting points (depend- 
ing only on the degree and some normalization) from which all roots of all 
polynomials of given degree are found, and so that the required number of 
iterations (or arithmetic complexity) is of similar magnitude as fast known 
methods of polynomial root-finding: it is 0(d 4 ) in the worst case, and 0{a9) 
or even 0(d 2 ) on average (up to factors of logd). More precisely, we will 
prove the following theorem: 

Theorem 1 (Newton Efficiency). 

For every degree d > 2, let V4 be the set of complex polynomials of degree 
d, normalized so that all roots are in the complex unit disk D. Then there 
is an explicit and finite universal set consisting of 3.33 d log 2 d(l + o(l)) 
points in C with the following property: 

• for every p G Vd, written as p(z) = cJT^z — aj), there are d points 

Z W G S d with N° n (z^) — > aj as n — > 00. 

Given e > 0, let nj G N be so that \Np nj (z^)) — ctj\ < e. Then the required 
number of iterations is bounded as follows. 

• Worst-case complexity: we always have 

Y^njeO (d 4 log 2 d + d 3 log 2 d\ log e|) . 
3 

• Average complexity: if the roots aj of p are all simple and have 
mutual distance at least d~ k for some k G N, then 

(1) J^rij G 0(d 3 log 2 d(logd + k) +dlog|loge|) . 

j 

In particular, if the roots are randomly placed in D, then the expected 
number of iterations is O (<i 3 log 3 d + dlog | log £ | ) - 

Further improvements based on our methods even show that for randomly 
placed roots in D, one can expect ^ ■ rij G 0(d 2 log 4 d + dlog \ loge|) [ABS]. 

This result measures the complexity in terms of Newton iterations. Of 
course, each Newton iteration requires arithmetic complexity d (at least 
the d coefficients of p have to be processed), but the evaluation of a given 
polynomial p of degree d at d different points simultaneously is possible 
with arithmetic complexity 0(d log d) using Fast Fourier Transform methods 
[AHU]. Therefore, the arithmetic complexity differs from the complexity 
in terms of Newton iterations only by a factor of logti: there would be 
little gain to implement the Newton method on a parallel computer because 
the method is intrinsically parallel even for classical computers with von 
Neumann architecture. 

Polynomial Root-Finding. There is an enormous literature on polynomial 
root-finding; see for instance McNamee's surveys [McNl, McN2]. Newton's 
method has been considered difficult to analyze: for instance, Pan [P] writes 
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"Theoretically, the weak point of these algorithms is their heuristic charac- 
ter. . . . Moreover, in spite of intensive effort of many researchers, conver- 
gence of these algorithms has been proved only in the cases where the initial 
point is already close to a zero or where another similar condition is sat- 
isfied." Our approach is not meant to compete with existing (sometimes 
rather elaborated and complicated) algorithms. Our point is to show that 
the classical simple and stable (and elegant) Newton method is far more 
efficient than anticipated; the arithmetic complexity in our main theorem 
in the average case differs from fast known algorithms (such as [P, R]) only 
by a logarithmic factor, and it could be a useful ingredient in more efficient 
algorithms. 

Notation. Throughout this text, we will fix a polynomial p € Vd of degree 
d > 2, and we write p(z) = cT\j(z — oij) and N p (z) = z — p(z)/p'(z). The 
coefficient c cancels for Newton's method and will be omitted. Each root 
oij has its basin Uj C C: this is the set of points that converge to ctj under 
iteration of N p . The immediate basin Uj is the connected component of Uj 
containing the root ctj. As long as we focus attention on a single root, we call 
it q and its immediate basin U. It is well known that U is simply connected 
and unbounded [Pr] (see also [Sh, HSS]). Denote by d\j the distance with 
respect to the unique hyperbolic metric on U with constant curvature — 1. 
Finally, let D r (a) := {z £ C : \z - a\ < r} for a G C and r > 0, and let 
D := -Di(O) be the complex unit disk. 

Overview of the arguments and structure of the paper. The fundamental 
idea of this paper is the area used per iteration step. We consider TVp-orbits 
(z n ) that stay in a certain disk Dr(0) for R > 1 ("i?-central orbits). We 
partition -Dr(O) into domains Sk so that for z G Sk, the nearest root has 
distance approximately 2~ k from z. This will imply that \z — N p (z)\ > I := 
2~ k jd. We will have orbits (z n ) in the immediate basin U of a for which the 
hyperbolic distance du(z n , z n +i) = O(logd). Roughly speaking, Euclidean 
distance bounded below by i and hyperbolic distance bounded above by log d 
means that z n and z n +i can be connected by a hyperbolic geodesic segment 
7 C U that has Euclidean distance at least ij log d from the boundary, so this 
curve "uses up" an area of approximately A n ^(£) > l 2 /\ogd = 4~ k /d 2 logd 
(length times width of the neighborhood of the curve). But is contained 
in the union of d disks of radius 2~ k and with total area at most 7rd4~ k , so 
there is room for no more than (d4r k ) / (A~ k /d? log d) = d?\ogd iterations 
in each Sk (always up to bounded factors). In the worst case, when there 
are multiple or near-multiple roots, we will show that we need to consider 
k < 0(d), hence a total of 0(d A \ogd) iterations is required. If the roots are 
well separated from each other, for instance if they are randomly distributed, 
it turns out that k < log d will usually suffice until the domain of quadratic 
convergence is reached where log | loge| further iterations yield precision e, 
so we need 0(d 3 log 2 d + log | loge|) iterations. All these count the number 
of iterations required to find a single root. But since all roots are competing 
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for the area, the number of iterations to find all roots satisfies the same 
bounds (except that the small log | loge| term acquires a factor d). 

Of course, all these estimates have to be made precise, and we have to 
make sure that the domains of area do no overlap. The paper is structured 
as follows. In Section 2 we introduce R-central orbits: these are orbits 
that never leave the disk Dr(0), and we give an existence criterion for such 
orbits. In Section 3, we construct an explicit finite set of starting points that 
contains, for each root, at least one .R-central orbit (z n ) with du(z n , z n+ \) = 
0(\ogd) as required. In Section 4 we estimate the area needed per single 
iteration step. In Section 5, we patch this together and estimate the area 
needed for each orbit: the main point is to make sure that the pieces of area 
assigned to each orbit point are disjoint; this will introduce another factor 
log d. 

It remains to discuss when to stop the Newton iteration. If the roots 
are well-separated from each other, then the iteration reaches the domain of 
quadratic convergence, and the necessary stopping criterion will be given in 
Section 6. We then bring all arguments together and describe the required 
number of iterations for "good" starting points. The worst case of roots that 
are not well separated, or possibly even multiple, is treated in Section 7. 

In Appendix A, we prove a general lemma on the area of certain neigh- 
borhoods of hyperbolic geodesies in Riemann domains; a major concern is 
to make sure that these neighborhoods do not overlap. 

Note that in general we are not interested in optimizing constant factors. 
At a number of places, we specified explicit constants when it was easy to 
do so, rather than referring to unspecified values; it is certainly possible to 
improve most constants significantly. 

Acknowledgements. I would like to thank especially Todor Bilarev, John 
Hubbard, Malte Lackmann, and Michael Thon for many helpful discussions; 
I am also grateful to Magnus Aspenberg, Walter Bergweiler, Edward Crane, 
Bjorn Gustafsson, Mary Rees, and Steffen Rohde. This research was par- 
tially supported by the EU Research and Training Network CODY, by the 
ESF network HCAA, and by the German Research Council DFG. 

2. Channels and ^-Central Orbits 

After a brief review of the geometry of immediate basins outside of D, our 
main goal in this section is to give a condition on orbits that always stay 
within a certain Euclidean disk around the origin; such orbits will be called 
".R-central". 

Consider the immediate basin U of a root a (see Figure 1); as mentioned 
above, it is simply connected. The geometry of these basins outside of D 
has been studied in [HSS]; in particular, U is unbounded. A channel of U 
is an unbounded connected component of U \ D, and an access to oo of U is 
a homotopy class (with endpoints fixed) of curves in U connecting a to oo. 
Every access to oo is fixed by N p , and so is every channel: if B is a channel 
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Figure 1. The dynamical plane of a Newton map (of a poly- 
nomial of degree 6). The basin boundaries are black, one 
immediate basin U with three channels is shaded, and a fun- 
damental annulus for the Newton dynamics is shown. 

of U, then N P (B) \ D = B. Each channel contains one access to oo, and 
each access to oo defines one channel through which it runs to oo. 

Every channel has an associated modulus: the quotient of B by the dy- 
namics of N p is a conformal annulus A := B/(N p ) with some modulus 
H = n(A) =: n(B). One of the main results in [HSS] is that each root of a 
polynomial of degree d has a finite positive number of channels, and at least 
one of them has modulus at least ir / log d. 

On the set B, we will use three different hyperbolic metrics (all with 
constant curvature —1): ds is the hyperbolic metric on B, while djj is 
the hyperbolic metric on U restricted to B. Finally, the quotient annulus 
A = B/(N p ) has a hyperbolic metric, and the infinitesimal metric on A lifts 
to an infinitesimal metric on B, called dA- Asymptotically near oo, all three 
metrics coincide, and in general we have ds > dA > d\j\ the map N p is 
an isometry on A and a contraction on U, hence dA> djj; moreover, there 
exists a branch iV^ 1 : B — ^ B; this branch contracts ds and preserves dA, 
hence ds > dA- For all z G B with N p (z) E B, we have aU(z, N p (z)) >ir/n 
(the core curve in A has hyperbolic length n/fi, and every simply closed 
curve in A is at least as long as the core curve). 

Lemma 2 (Hyperbolic Distance Across Fundamental Domain). 

Let U be an immediate basin and B C U be a channel of modulus [i. Then 

any w,w 6 B with \w\(d — l)/d > \w\ > 5 satisfy djj(w, w) > 2/5( / u + it). 

Proof. We will use the fact that every \w\ > 1 satisfies \N p (w) — w(d — 
l)/d)\ < l/d [HSS, Lemma 3]. Let w' := N p (w); we have d B (w,w') > 
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(Ia(w,w') > ir/fi, but \w — w'\ < (\w\ + l)/d. Therefore, there are points 
a G dB and w" G [w,w'] (the straight line segment from w to w') with 
\a - w"\ < 2fi(\w\ + l)/-nd; thus \a - w\ < (2^/vr + l)(\w\ + l)/d. 
Suppose first that a G dU. Then 

, , , f lw ~^ dt 

du(W,w) > / —7- ; — ; — — r- 

V ' Jo 2((2n/n + l)(\w\ + l)/d + t) 

1 , / |w; — w\ 

■log 1 + 



2 to V (2^/7r + l)(H + l)/d 



2 & V (W 7r + 1 )(IH + 1 )Av 2 b V 6(2/i/vr + l) 



> 6(2 M /7r+l) = 57T 



2 ( 1 + 6(^Ti)) 24^ + 22^ 5(^ + vr)- 

Now we discuss the case that there is no a G d£7 so that \a — w"\ < 
2/j,(\w\ + Vj/itd for any it;" G [«;, u/]. Then a G <9.B \ dU , i.e., |o| = 1 and 
\w\ — 1 < \a — w | < (2/x/7r + 1) ( | | + 1) /g?, hence 

t / ,|«;| — 1 \ vr/4<i-6\ 7r(2d - 3) 
/i > - df-^ - 1 > 



2 \ \w\ + l J 2 V 6 J 6 

In this case, there is certainly a point a' G 9f7 with |it; — a'\ < \w\ + 1 
(some point a' G D), and we get 

j , „, /•I"'-*! dt 1 / |w-u)|\ 

du[w,w) > / = - log 1 + 

Jo 2(|w;|+l + i) 2 V |w| + 1 J 

> 2 log ( 1 + ^TlJ " 2 g ( + / } > 



2(l + 5/6d) 



2(6d + 5) ' 

Since in this case, /x + 77r/9 > 7r(6d + 5)/18, we get again 

5 5"7T 5-7T 2 

du(w,w) > — — — — ■ > — — — — - > — — — > 



2(6d + 5) 36(^ + 7^/9) 36^ + 33vr 5(/z + tt)' 

□ 

Now we can prove an existence criterion for orbits that, once they reach 
D Pi U, will never leave -Dr(O) for a certain radius R > 0. It is easy to check 
that if \z\ > 1, then |A^(z)| < \z\ (this follows from [HSS, Lemma 3] as cited 
above), so all orbits outside of D move towards D, and control can be lost 
only for orbit points in D (especially near poles of N p ). 

Definition 3 (i?-Central Orbits). 

An orbit (z n ) will be called i?-central if \z n \ < 1 implies \z n /\ < R for all 
n' > n. 
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Proposition 4 (Existence Criterion for i?-Central Orbits). 

Suppose all channels of an immediate basin U have modulus at most li with 
[x > 7r/logd, and z G U has djj(z, N p (z)) < 2ir/[x. Then the orbit of z is 
R-central for R < 5(d/(d - l))rs>r(iogd+i)l . 

Proof. Any two points w, w G U with \w\ (d—l)/d > \w\ > 5 have djj(w, w) > 
2/5(/U + 7r) by Lemma 2, so going in U from radius r > 5 to radius r(d/ (d — 
j^[57r(iogd+i)] fans to cross at least |~57r(logd + 1)] disjoint concentric annuli 
with boundary radii differing by a factor of d/(d — 1), and the hyperbolic 
distance is at least 

[57r(logd+ l)] • > 2^-^-^ > 2 JL > du(z,N p (z)) . 

□ 

Remark. This result provides an upper bound for R that is uniform in 
d: we have R < 5((d - l)/rf) r5vr(io g d+i)l < 5e r57r(io g d+i)l/rf_ ultimately, the 
precise value of i? is not of too large importance as it will enter our estimates 
only logarithmically: we need MgZ with 2 M — 1 > R, and M = 15 works 
for all d, while M = 8 works for d > 10 and M = 4 for R > 60. 

Consider again a channel B C U with modulus /U. The quotient annulus 
A := B/(N p ) has a core curve (the unique simple closed geodesic) of hyper- 
bolic length n/fi. We define the central subannulus of A as the set of points 
z £ A with injectivity radius less than 2n/fi. Let the central subchannel of 
B be the preimage in B (under the quotient map) of the central subannulus: 
this is the set of points z G B with dA(z, N p (z)) < 2n / ' \x. 

Lemma 5 (Central Subchannel). 

If A has modulus \x, then the central subannulus of A is a parallel subannulus 
of A with modulus greater than 2fi/3. 

Proof. The quotient annulus A is conformally equivalent to the horizontal 
strip {z G C : |Im z\ < h} modulo z ~ z + 1, where 2h = fi. The infinitesimal 
hyperbolic metric on the strip is given by 

ds 



2hcos(n\y\/2h) ' 

so the length of the simple closed geodesic in A is tt/ll, and the parallel 
subannulus is the set of points x + iy for which \y\ satisfies a certain upper 
bound. In particular, if cos(7r|y|/2/i) > 1/2, then the injectivity radius is less 
than 2tt/li (the horizontal curves of Euclidean length 1 at imaginary part y 
have hyperbolic lengths less than 2tt/li, and the corresponding geodesies are 
shorter than this, but longer than tt/li). The condition cos(7r|y|/2/i) > 1/2 
is satisfied if \y\ < 2h/3, so the central subannulus has modulus at least 
2/x/3. □ 



We now show where i?-central orbits can be found. 
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Proposition 6 (Large Central Subchannels Have .R-Central Orbits). 
Let again U be the immediate basin of a root a and let B C U be a channel 
of a with maximal modulus. Then all points z G B in the central subchannel 
of B have R-central orbits, for R as in Proposition 4- 

Proof. Let [i be modulus of B. If z is in the central subchannel of B, then 
du(z, N p (z)) < dA(z, N p (z)) < lit I \x. By Proposition 4, the orbit of z is 
.R-central. □ 



3. Good Starting Points 

In this section, we construct a finite set of starting points depending 
only on the degree d (and the normalization of p) so that for each root a of 
p, one of the points z G Sd is in U a , satisfies r = du(z,N p (z)) < 21ogd, and 
so that the orbit of z is i?-central for R as in Proposition 4. 

All we need to do is to specify a finite set of starting points that will 
intersect, for every root, the central subchannel of the channels with largest 
modulus. Since every root has a channel with modulus at least ir / log d, 
it is sufficient to specify a finite set of starting points that intersects all 
subchannels of all channels with moduli at least 27r/31og<i. This can be 
accomplished by the methods in [HSS], so we can now construct an explicit 
point grid (in that paper, we used .2663 log d concentric circles that each 
contain 8.33d log d points; here we use (3/2) as many circles because we want 
to hit the channel within the central subchannel with (2/3) the modulus). 

Definition 7 (Efficient Grid of Starting Points). 

For each degree d, we construct a circular grid Sd of starting points as fol- 
lows. For v = 1, 2, . . . , s = [0.4 log d] , set 



r v := (1 + V2) ( 



d- 1 

d 



(i/-l/2)/2s 



and for each circle around of radius r u , choose [8.33d log d~\ equidistant 
points (independently for all the circles). 

Theorem 8 (Efficient Grid of Starting Points). 

For each degree d, the set Sd contains 3.33<ilog 2 d(l+o(l)) points, and it has 
the following universal property. If p is any complex polynomial, normalized 
so that all its roots are in D, then there are d points in Sd that converge to 
the d roots of p, so that for each root, one of these points is in the central 
subchannel of the channel with largest modulus. The corresponding orbits 
are all R-central. More precisely, if \x is the largest modulus of a channel of 
the root a with immediate basin U, then the corresponding orbit converging 
to a satisfies du(z,N p (z)) < 2ir/[i < 21ogd. 

Proof. The annulus 

V := G C: (1 + y/2)y/{d-l)/d < \z\ < (1 + ^2)} 
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is contained in a fundamental domain of the Newton dynamics by [HSS, 
Lemma 10]. For u = 1, 2, . . . , s = [0.4 log d] , subdivide B into s subannuli 




Since V is contained in a fundamental domain of the dynamics, each 
subchannel of any root with modulus [i intersects V in a quadrilateral of 
modulus at least [/,, and by the Grotzsch inequality, it intersects at least one 
V u in a quadrilateral with modulus at least s[x. Each root has a channel with 
modulus at least 7r/logd, so the central subchannel has modulus at least 
27r/31ogd, and this central subchannel intersects some V u in a quadrilateral 
with modulus at least 2s7r/31og<i > 0.2663-7T, independent of d. In [HSS, 
Section 6], it is shown that [8.3254dlog d] equidistributed points on each of 
these circles will find all quadrilaterals connecting the boundaries of the B y 
with modulus at least 0.2663-7T, as in our case. 

Therefore, the grid Sd intersects all central subchannels of all largest 
channels, and the claim follows. □ 

Remark. The number of starting points of 0(dlog 2 d) from [HSS] has been 
further reduced to 0(<i(loglog<f) 2 ) in [BLS], by using a probabilistic set of 
starting points. This approach could also be used in our case. 

Now we have a good set of starting points leading to i?-central orbits. We 
now proceed to estimate the number of required iterations. 



4. Area per Iteration Step 

In this section, we show that every iteration step "uses up" a certain area 
in the plane; since i?-central orbits are bound to some disk Dr(0), this will 
give an upper bound on the required number of iterations. 

Consider some point z £ U, set r := du(z, N p (z)), and let 7: [0,T] — > U 
be the hyperbolic geodesic connecting z to N p (z), parametrized by Euclidean 
arc length. For t £ [0, T], let rj(t) be the Euclidean distance from j(t) to 
dU, and let X(t) be the Euclidean straight line segment of length r](t) (not 
containing the endpoints) with center at j(t) and perpendicular to 7 at 7(t), 
so that j (to) disconnects X(to) into two open segments of length rj(to)/2. 
All the segments X(to) are disjoint (see Appendix A, Proposition 23). 

For k € N, let Xj,(t) be the restriction of X(t) to lengths at most 2~ fe+1 
(i.e., X/, is the perpendicular line segment to j(t) centered at j(t) and 
extending in both directions for a length of mm(rj(t)/2, 2~ k ) ). Let be 
the area covered by the Xk(t) for t £ [0,T], and let Ak(t) be the analogous 
area for t £ [0, £] for £ < T. This set of course depends on z, so a more 
explicit description of this set would be Ak(£,z). 

Lemma 9 (Area Per Iteration Step). 

We have A k (£) > £ 2 /(2t + 2 k ~ 1 £). 
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Proof. The hyperbolic length of 7 between z and N p (z) satisfies 

1 f T dt 1 f e dt 



2 J v (t) ~ 2 J v (t) ' 

This implies 

rt dt [ e dt f e dt 



< I + / < 2r + 2 fc_1 £ . 



o min 2"*+!) - Jo V (t) J 2-*+* 

Since all Xk(t) are disjoint, it is an easy exercise in elementary differential 
geometry that 



A k {t) = J 2min(r / (t)/2,2- fe ) dt = J min (p(t), 2~ fc+1 ) 



dt 



The Cauchy-Schwarz inequality for the functions -y/min 2 fc+1 ) and 
1 / ^min (r?(i), 2~ fc+1 ) implies 



A fc (^) = ^ min (77(f), 2~ fe+1 ) dt > 



I 

Jo 



min (r/(t),2- fc+1 

> ^ 

- 2r + 2 fc -^' 

Define the following subsets of C, for k G Z: 

S|:=|z£C: min|z - ay| < 2~ fe 



□ 



and 

S fe := L G C: min|z - a,-| G (2-( fe+1 \ 2~ fc ] | = S* k \ S* k+l . 

Every S^ is contained in a union of closed disks of radius 2~ k around all 
the roots, and the S k (appropriately restricted) partition D 2 m_ 1 (0), for 
k > -M. 

Lemma 10 (Distance from A k (£) to Roots). 

If z £ SI and \z — N p (z)\ > i, then each point in A k {l) has Euclidean 
distance at most 2~ k+1 + t from a root of p. 

Proof. Since z G St, the point z has distance at most 2~ k from a root; then 
in A k {t) we consider only a segment of length £ on the geodesic from z to 
N p (z), and each point in A k (l) has distance at most 2~ k from a point on 
this geodesic segment. □ 
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5. Area Along an Orbit 

We continue to consider the immediate basin U of some root a; from 
now on, we will assume this root is simple. Let <p: U — > B be a Riemann 
map with (f(a) = 0; then / := (p o N p o if' 1 : B ->• D is holomorphic with 
/(0) = = /'(0). Hence f(w)/w 2 is still holomorphic and sends D to itself, 
and it follows that |/(u>)| < \w\ 2 . 

Consider the orbit z n = N° n (zo) for a point zo € U and let r := du(zo, z\)\ 
then du{z n , z n +i) < t for all n. (Eventually, we will use r = O(logd).) 

Lemma 11 (Hyperbolic Distance to Root and Newton Dynamics). 

(1) If \p(z n )\ > \(p(zn>)\ > e 1 / 2 - 1 « 0.649, then d v {z n ,z n >) > \n - n'\/2, 

and even do(\<p(z n )\, \(p(z n >\) > \n — n'\/2. 

(2) If \ip{z n )\ < 1/2, then \z n > — a\ < e for all n' > n + log 2 | log 2 e — 5|. 

Proof. If | to | > e 1//2 — 1, then 

d D (HI/MI) > |iog(i-H)-iog(i-|/HI) 

> |log(l - \w\) - log(l - \w\ 2 )\ = log(l + \w\) > 1/2 

(in the first inequality, we used the fact that the hyperbolic distance in D 
exceeds the hyperbolic distance in a half plane containing ED). Now we have 
du(z,N p (z)) = dj$(ip(z),ip{N p (z)) = do(<p(z),f{ip(z)) < dn(\<p(z)\, \f(ip(z)\), 
so every subsequent iterate of z has, when mapped by tp into D, ever smaller 
absolute values so that logarithmic distances grow by 1/2 in each iteration, 
and the first claim follows. 

If \<p(z)\ = \w\ < 1/2, then \ip(N° m (z))\ = \f om (w)\ < 2~ 2 ™ . By the 
Koebe 1/4-theorem, |(i^~ 1 )| / (0) < 8 (there are roots other than a in D, so 
not all of D can be in U). By the Koebe distortion theorem, \N° m (z) — 
a\ < 32 • 2~ 2m , and this is less than e provided m > log 2 | log 2 (e/32)| = 
log 2 |log 2 e-5|. □ 

Define sets A n ^(t) := Ak(£,z n ): these are the sets Ak(£) based at the 
points z n , as defined at the beginning of Section 4. 

Lemma 12 (Disjointness of Areas). 

The sets A n k(£) and A n i y(t) are disjoint if n' — n > 2r + 6 and \<p(z n /)\ > 
e 1 ' 2 -!. 

Proof. If n' > n + 2r + 6, then du(z n , z n >) > r + 4 log 2 by Lemma 11, and 
all points on the geodesic segment from z n to z ra +i have hyperbolic distance 
(in U) at least 4 log 2 from z n r, and since n' > n, all points on the geodesic 
segment from z n to z n+ \ have hyperbolic distance at least 4 log 2 from all 
points on the geodesic segment from z n > to z n '+i (because this is so for their 
images in D under \<p\). Essentially by definition, each point on X(t) has 
hyperbolic distance from the base point j(t) of at most 2 log 2, and the claim 
follows. □ 
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Proposition 13 (Number of Points in S^). 

For any k and any £ > 0, each set S*| contains at most 

(2) 7Td(2' k+1 + £) 2 (2t + 2 k ' 1 £) \2t + 6]£~ 2 
points with \z n — z n+ \\ > £ and \<p(z n )\ > e 1 / 2 — 1. 

Proof. If z n G SI and \z n — z n +i\ > £, then each point in A n ^(£) has Eu- 
clidean distance at most 2~ k+l +£ from some root by Lemma 10, so A n k {£) 
is contained in a set of total area at most Tid{2- k+1 + £) 2 . Each A n>k (£) 
has area at least £ 2 /(2t + 2 k ~ 1 £) by Lemma 9, and by Lemma 12 the sets 
Ai,fc(^) an( i & re disjoint if n' — n > 2r + 6 and |<^(z n /)| > e 1 / 2 — 1. 

Therefore, there can be at most 

7rd(2- k+1 +£) 2 \2t + Q\ _ ird(2- k+1 + l) 2 (2r + 2 k - l £) \2t + 6] 
^ 2 /(2r + 2 fc -^) ~ I 2 
such points, for any choice of £. □ 

Remark. The sets A nk (£) are contained in U, so in the end the various 
orbits in the different immediate basins U a for different roots a will compete 
for the area. The last result can thus be sharpened as follows. For a root a, 
let 

U a ,k{£) ■= {z eU a : \z — ctj\ < 2~ k+1 + £ for some root etj} 

(this is the 2~ k + ^-neighborhood of S%, restricted to U a ). Then the set 
contains at most 

(3) area (U a>k {£)) • (2r + 2 k ~ 1 £) \2t + 6] /£ 2 

points on the orbit (z n ) C U a with \z n — z n +i | > £ and \ip(z n )\ > e 1 / 2 - 1, 
and of course we have 

(4) Yl area (^.fc (*)) ^ ^( 2_fe+1 + ^ • 

i 

Lemma 14 (Newton Displacement and Nearest Root). 

For any z G C, i/te nearest root a satisfies \z — a\ < d\z — N p (z)\. 

Proof. This is easy and well known: 

(5) z-N p (z) = 



hence 



\z-N p (z)\ ^\z-aj\ mf a \z 



□ 



Corollary 15 (Number of Points in S k )- 

For any k, the set S k contains at most 

(6) ^(4d + l) 2 (2rd+l/4)[2T + 6l G 0(d 3 r 2 ) 

points on any orbit (z n ) C ?7 mi/t |<£>(£ n )| > e 1 ^ 2 — 1- 
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Proof. If z n G Sk, then \z n — z n +i| > l/d2 k+1 by Lemma 14, so we use 
t = l/d2 k+1 in Proposition 13 and obtain the estimate 

vrd 3 4 fc+1 (V fe+1 + 2 (2t + 1) \2r + 6] 

= 7r(4d+l) 2 (2rd + l/4)[2r + 6l 
as claimed. □ 

Remark. As before (see (3)), the different orbits have to compete for the 
total area available, and the set Sk can contain at most 

(7) area {U a , k (l/d2 k+1 )) • (2r + l/4d) \2t + 6] d 2 A k+1 

points on the orbit (z n ) C U a with |</?(z n )| > e 1 / 2 — 1. 

6. Uniformly Separated Roots 

Now that we have good bounds on how many iterations any orbit can 
spend within any S k , we have to discuss the possible values of k. Any disk 
Dr(0) is partitioned by Sk H -Dr(O) for k > — log 2 (i? + 1), and we gave an 
upper bound for R, hence a lower bound for k, in Proposition 4. We also 
need an upper bound for k: we need a statement of the form "if z is in the 
immediate basin U a for some root a, and \z — a'\ is small for some root a' , 
then | z — a | is also (reasonably) small". 

We will need two kinds of stopping criteria: a worst-case estimate that 
applies especially when there are multiple or near- multiple roots, and a 
better estimate in case the roots are reasonably well separated from each 
other; this latter concept is related to Smale's "approximate roots". We first 
pay attention to well-separated roots: we say that the roots are 5-separated 
if they are all simple and have mutual distance at least 5. If roots are 
randomly distributed in D, with high probability they will be 5-separated 
with 5 = 0(l/d) (see the remark at the end of this section). Multiple or 
near-multiple roots will be treated in Section 7. 

Lemma 16 (Stopping Criterion). 

(1) If \z — a\ < \z — a'\/2d for all roots a' / a, then the Newton orbit of z 
converges to a. 

(2) If even \z-a\ < \z-a'\/(4d + 3) for alia' / a, then \N on (z)-a\ < e 
for all n > log 2 | log 2 £ — 5|. 

Proof. (1) We may rescale coordinates by an automorphism of C so that 
z = and a = 1; denote the rescaled basin of the root a = 1 again by 
U. We have \z — a\ = 1 and \z — ctj\ > 2d for all aj / a. As in the 
proof of Lemma 14, this implies | Yl aj ^a — a j)\ < {d — l)/2d < 1/2, so 
Yl a - ^/( z ~ a j) 1S contained in D 1 / 2 (— 1), the open disk of radius 1/2 around 
— 1. Thus z — N p {z) £ D 2 /3(— 4/3) (the image of a circle under z i— >■ 1/z is 
a circle, and here we identify it by a diameter), hence N p (z) G D 2 / 3 (4/3), 
thus \N p (z) — a\ < 1 = \z — a\ and the orbit of z converges to a. 
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(2) If \z — a\ < \z — a'\/(Ad + 3), then again we choose coordinates with 
a = 1 and z = 0, so all \a'\ > Ad + 3. All z' £ D := D 2 (l) have \z' - a\ < 
2 < \z' — a'\/2d, so D C U, and we have djj(z,a) < djj(l/2,0), hence 
|i^(z)| < 1/2. The claim thus follows from Lemma 11 (2). □ 

Proposition 17 (Number of Iterations on Orbit, <5-Separated Case). 
Suppose a is a simple root and \a' — a\ > 5 for all roots a' / a. If (z n ) is 
an R-central orbit in U with \zq\ < R < 2 M — 1 and djj(zo, z\) < t, then we 
have \zn — a\ < e for all N at least 

ir(4d + l) 2 (2rd + 1/4) [2r + 6] riog 2 (4d + 4)/ 6 + M + 1] + log 2 | log 2 e - 5| 

G O (d 3 T 2 (logd+ | log 5 1 + M) +log|loge|) . 

Proof. If \z — a\ < S /{Ad + 4), then all roots a' / a satisfy \z — a'\ > 
5-5/(Ad + A) = (4d + 3)(5/(4d + 4), so the orbit of z satisfies the hypothesis 
of Lemma 16 (2) and will be e-close to a after at most log 2 | log 2 e — 5| 
iterations. 

Therefore, choose K £ N with 2~ K < 5/(4d + 4), i.e., K = [log 2 ((4d + 
4) / 5)~\ . We only have to consider the number of iterations that the orbit stays 
in Sk with k < K. Since the orbit is contained within D 2 m_i by hypothesis, 
we have k > —M, so we need to consider k 6 {— M, —M + 1, . . . , K}. 

By Corollary 15, any orbit (z n ) with du(zo,zi) < r has at most 7r(4d + 
l) 2 (2rci+l/4) [2r+6] points within each Sk, so the total number of iterations 
required for the orbit z n is at most 

7r(4d + l) 2 (2rd + 1/4) \2t + 6] riog 2 ((4d + 4)/<5) + M + 1] + log 2 | log 2 e - 5| . 

□ 

Remark. Again, the d roots have to compete for the available area within 
D. If all roots are simple and (^-separated, and there are d orbits, one in 
each immediate basin, that satisfy the hypotheses of Proposition 17, then 
the combined number of iterations required to reach e-precision for all d 
roots is at most 

7r(4(i+l) 2 Urd+^j [2T + 6] 

+d\og 2 | log 2 e - 5 1 

G 0(d 3 r 2 (log(f+|log5| +M) + dlog|loge|) ; 

this is almost the same bound as for a single orbit (each area element can 
be used for only one root), except that the estimate log 2 | log 2 e — 5| (which 
does not involve area) applies for each root separately. 

Theorem 18 (Efficient Grid of Starting Points). 

For each degree d, the set has the following universal property. If p is 
any complex polynomial, normalized so that all its roots are in D, and so 
that all roots are simple and have mutual distance at least 5, then there are 



log 2 



Ad + A 



+ M + 1 
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d points in Sd that converge to the d roots of p, and so that the combined 
number of iterations required to reach e -precision is at most 

n(4d + l) 2 (4d log d + 1 /4) [4 log d + 6] x 

x riog 2 ((4d + 4)/<S) + M + 1] + dlog 2 | log 2 e - 5| 

G O (d 3 log 2 d(log d + | log <5|) +dlog|loge|) . 

Proof. For each root ccj, there is a point G 5^ in the central sub- 
channel of the largest channel of on, and its orbit is .R-central and satis- 
fies du(z^,N p (z^)) < t < 21ogd (Theorem 8). Choose M G N with 
2 M — 1 > R (see the remark after Proposition 4). The claim thus follows 
from Proposition 17 and the remark thereafter. □ 

Remark (Expected Mutual Distance Between Roots). The results in this 
section were under the assumption that all roots were ^-separated for some 
5 > 0. If d roots are placed independently and randomly into D (with 
respect to planar Lebesgue measure), then the mutual distance between any 
two roots is easily seen to be at least of the order 1/d. Theorem 8 thus applies 
and yields a number of iterations of at most 0(d 3 log 3 d + dlog | loge|). 

If not the locations of the roots are chosen randomly, but for instance the 
coefficients, then the roots may no longer be equidistributed with respect 
to area; they tend to accumulate along a circle. In any case, the relation 
between coefficients and roots is algebraic, so the expected mutual distance 5 
between roots is bounded by a power law in d, say 5 > 1/d 13 with some /3 > 1, 
but this still becomes only a constant factor in the number of iterations. 

Remark (Further Improvements). The greatest loss in our estimates is in 
the most basic of our estimates, in Lemma 14: if \z — N p (z)\ < s, then 
\z — a\ < ds for some root a. This bound is sharp only if all roots form a 
single multiple root, and then indeed the distance to the root is multiplied 
by (d— l)/d in each Newton iteration. If the roots are randomly distributed, 
then the bound is much better, and this leads to significant improvements. 
Refining our methods in this direction, the following is shown in [ABS]: If 
the d roots are distributed independently in D and randomly with respect to 
area measure, then the number of iterations for the same grid Sd as before 
is at most 0(d 2 log 4 d + dlog | log with high probability. This improves 
our bound by a factor of d/logd, and it is optimal except for some powers 
of log: if we have d starting points outside of D, then each of them takes 
approximately d iterations to move from any radius r to radius r/e, so all 
d points need 0(d 2 ) iterations even to get close to D. Our methods require 
to place the starting points outside of D, and under this assumption the 
number of iterations is essentially best possible. 

7. Non-Uniformly Separated Roots 

If the roots are not uniformly (5-separated, then they may be multiple, 
and the local rate of converge may be linear rather than quadratic. For our 
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purposes, multiple roots are the same as simple roots at a distance smaller 
than the required precision e. Our previous estimate on the required number 
of iterations diverges essentially with 0(rf 3 | log 5\): this is slow in 5, but 
unbounded. We will now provide a uniform bound on the required number 
of iterations for all polynomials in T>d\ We will assume that all roots are 
simple, but since we do not assume a lower bound on their mutual distance, 
the estimates hold for multiple roots as well, by continuity. The point grid 
that we will use is the same as before. 

Our estimates will be based on the following stopping criterion. 

Lemma 19 (Worst-Case Stopping Criterion). 

If z is in the immediate basin of a roota, then \z — a\ < fd\z — N p (z)\, where 

_ d\d-l) (2d\ 2d _, 

depends only on d and satisfies log 2 fd<2(d—l) + 2 log 2 d. 

Proof. This result is proved in [Schl, Lemma 5], using an iterated "cluster 
of roots" argument. □ 

The following result is the "worst-case" version of Proposition 17. 

Proposition 20 (Number of Iterations on Orbit, Worst Case). 

If (zn) is an R-central orbit in U with du(zo, z±) < r and \zq\ < R < 2 M — I, 

then \zn — a\ < e provided N is at least 

Ti(Ad + l) 2 (2rd + 1/4) \2t + 6] (2{d - 1) + log 2 d - log 2 (e) +M + 1) 

+ log 2 | logs - 5| + 1 
G 0(d 4 T 2 + d 3 r 2 (| log e\ + M) + log | log e|) . 

Proof. We iterate the orbit (z n ) while \z n — z n+ i\ > e/fd and |</?(z n )| > 
e 1 / 2 — 1. If at some time \z n — z n+ \\ < e/fd, then we can stop by Lemma 19, 
and if |(/p(zjv)| < e 1 / 2 — 1, then Lemma 11 (2) applies and only log 2 | log 2 e — 5| 
further iterates are required until e-precision is reached. 

We now estimate how many iterates are necessary until \zn — zjy + i\ < 
e/fd] we may suppose that along the way, we always have |^(z n )| > e 1 / 2 — 1. 

We will use Corollary 15 for k = — M, 0, 1, 2, . . . , K, where K is the 
least integer such that l/d2 K+l < e/f d , i.e., K = [log 2 (/ d /2<fe)] < 2(d - 
1) + \og 2 d — log 2 (2e) + 1. The number of orbit points in Sk is at most 
7r(4d + l) 2 (2rd + 1/4)|2t + 6]. The total number of points in S* K with 
\z n — z n+ \\ > e/fd > l/d2 K+1 satisfies the same bound (Proposition 13). 
Moreover, we have D 2 m_ x (0) C S- M U S- M +i U ■ ■ ■ U S K -2 U • ■ ■ U S K -i U S* K , 
so the total number of iterations with \z n — z n +i| > e/fd is at most K+M+l 
times the number for each S^, and by hypothesis the orbit never leaves the 
disk D 2 m_ 1 . □ 



EFFICIENT GLOBAL NEWTON DYNAMICS 



17 



Theorem 21 (Worst Case Number of Iterations). 

For each degree d, the set Sd constructed in Definition 7 has the following 
universal property. If p is any complex polynomial, normalized so that all 
its roots are in ED, and with simple or multiple roots at arbitrary mutual 
distances, then there are d points in Sd that converge to the d roots of p, and 
so that the combined number of iterations required to reach e-precision is at 
most 

vr(4d + l) 2 (2rd + 1/4) \2t + 6] (2(d - 1) + log 2 d - log 2 (e) + M + l) 

+dlog 2 | loge — 5| + 1 

G O (d 4 log 2 d + d 3 log 2 d\ logeQ . 

Proof. It suffices to prove this for the case that all roots are simple; the case 
of multiple roots follows by continuity. The set Sd intersects the central 
subannulus of the channel with largest modulus of each root in at least one 
point, so the corresponding orbits are i?-central and all their orbit points z 
satisfy d(z,N p (z)) < r := 21ogd. 

By Proposition 20, each of these points needs at most 

vr(4d + l) 2 (2rd + 1/4) \2t + 6] (2(d - 1) + log 2 d - log 2 (2e) + M + 2^ 

+ log 2 | log e — 5 1 + 1 

iterations to be e-close to the corresponding root. Since all roots again have 
to compete for the area within D, the total number of iterations combined 
to get e-close to all d roots, for one starting point per root, satisfies the same 
bound, except that a factor d comes in in the part of the estimate where the 
roots do not compete for area, and this is the term with log 2 | log 2 e| (which 
is subordinate to the d\ loge| term). This proves the claim. □ 

Remark. We believe that this result in the worst case can be improved at 
least by a factor of d: the factor fd in Lemma 19 is exponential in d, and 
the worst case leading to this estimate seems very unrealistic. Even though 
fd enters only logarithmically, log fd still contributes a factor of d. If fd 
could be replaced by a polynomial in d, this would gain a factor of d/logd 
(However, the complexity in e really is | loge|, rather than log | loge|, in the 
presence of multiple roots). 

Appendix A. Geometry of Hyperbolic Geodesics 

In this appendix, we will prove the claim that the line segments X(t) as 
introduced in Section 4 are disjoint. We will repeatedly use the theorem 
[J] that every Euclidean disk D C U is convex with respect to hyperbolic 
geodesics in U. This easily implies that for every geodesic 7, the set 7 n D 
is connected. 

Lemma 22. For every to G R, the Euclidean curvature radius of ^ at 7(^0) 
is at least rj{to)/2. 
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Proof. Let R G (0, oo) be the Euclidean curvature radius of 7 at 7(^0) (if 
R = 00, then we have nothing to show). If R < r](to)/2, then let C be a 
circle of radius between R and r/(to)/2 and tangent to 7 at 7(to), an d so 
that for t / to sufficiently close to t, the point j(t) is in the disk bounded 
by C; call this disk D. But then 7(to) disconnects 7 fl U in contradiction to 
Ahlfors' theorem that disks are hyperbolically convex. □ 

Remark. This bound might well be sharp. (Bjorn Gustafsson observed 
[GS] that it is sharp for domains U C C that may contain the point at 00, 
and Edward Crane observed that it is not far from being sharp for domains 
U = C\R ). 

Recall that for a Riemann domain U C C and a hyperbolic geodesic 
7: R — >■ U parametrized by Euclidean arc length, we defined r/(i) as the 
Euclidean distance of j(t) to dU, and X(t) as the straight line segment 
(without endpoints) of length rj(t) with center at j(t) that intersects 7 at 
j(t) in a right angle. 

Proposition 23. All X(t) are disjoint. 

Proof. (0) Suppose X(to) and X(ti) intersect; without loss of generality, 
suppose that ry(io) > v(ti) an d ti > to- Let Dq be the open disk centered at 
7 (to) and with radius 77 (to), and let C\ and C2 be the two circles of radius 
w(to)/2 tangent to 7' (to); then both circles are tangent (from the inside) to 
ODq, and X(to) is exactly the open straight line segment connecting their 
centers. Let D\ and D2 be the two open disks bounded by C\ and C 2 , and 
let £ be the straight line through their centers. Without loss of generality, 
we may assume that X(to) and X{t\) intersect within D±. 

(1) First observe that the geodesic 7 is disjoint from D\ and D2: if there 
is some point 7^2) G Di, say, then consider the disk D[ C D\ so that 
dD[ n &Di = {7(t )} and 7(t 2 ) G dD[. Then d[ C U and the radius of 
D[ is smaller than the radius of D±, and hence smaller than the radius of 
curvature of 7 at 7(to) by Lemma 22, so 7(t) is not in D l for t near to, 
except that 7(to) G dD[. But 7^2) G dD[, and this contradicts Ahlfors' 
hyperbolic disks theorem. 

(2) Let E be the convex hull of D1UD2. We claim that jdE is connected. 
Indeed, suppose 7(t 2 ) E, but 7^3) G E, with to < t 2 < t3, say. Then 
there is a disk D' with radius r](to)/2 and with center on X(to), and so that 
7(t3) G D ; it satisfies D C U. But since 7(to) G D , it follows that 7 n D 
is not connected, a contradiction. 

(3) Our next claim is that along "yOE, no tangent vector is perpendicular 
to 7' (to); this means that, if we call the direction of the tangent vector 7' (to) 
"horizontal" , then 7 n E is a graph over the horizontal direction. For a proof 
by contradiction, suppose there is a t2 > to, say, with 7(t 2 ) G E and 7' (£2) 
perpendicular to 7' (to)- Let and D' 2 be the two disks with centers on £ 
and with radii r)(t )/2 and so that 7(t 2 ) G dD[ndD' 2 . Since 7(t 2 ) Dil)D 2 , 
it follows that 7(t ) G D[ n -D 2 > an d hence that D\ C E for i = 1, 2. One of 
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Figure 2. Illustration of the proof of Proposition 23. The 
two disks Di and D2 are tangent to j(t) at t = to, the set 
E is the convex hull of D\ and Di- Within E, the curve 7 
is a single graph over the horizontal direction. The disk D 
(marked by a thin boundary circle) is a vertical translate of 
D\ so that 7(ti) G dD. The fact that 7 must leave D at 
7(ti) forces the slope of 7 at t\ to be no larger than the slope 
of dD at that point, and this tilts X(t\) so that its distance 
to I is greater than the radius of D\. 

the two disks D[ and D' 2 thus has the property that 7^2 + £) G ^ for small 
e > 0, but not for small e < 0. Since 7(io) G ^ and to < *2 — £ < this 
contradicts Ahlfors' theorem once again. 

(4) Now consider the point 7(^1) and let D be the unique disk of radius 
rj(to)/2 with center on i and so that 7(<i) G 3D (this leaves two choices, and 
we take the disk with center closest to the center of D\). Since 7(^1) G" Z?i, 
the center of D is strictly between the centers of D\ and D2, and hence 
7 (to) G .D. By Ahlfors' theorem again, we have j([to, h]) C D, and 7(f) G" D 
for t > ti. 

Since 7 must leave D in the direction of increasing t, the tangent vector 
7'(ti) must either be parallel to the tangent vector of dD at 7(^1), or its 
slope must be smaller. If they are parallel, then the distance from 7(ti) 
to £ along X(t\) is exactly r](to)/2 (the radius of D), while the length of 
X(ti) is r/(ii)/2 < 7?(to)/2 (from the center point 7(<i) in both directions), 
so X(ti) cannot intersect X(to). If the slope of "y'(ti) is smaller than that 
of the circle, then the distance from j(ti) to I in the direction of X(t\) is 
even greater. Therefore, X(to) and X(ti) cannot intersect. □ 
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